Towards a Knowledge Graph based Speech Interface

نویسندگان

  • Ashwini Jaya Kumar
  • Sören Auer
  • Christoph Schmidt
  • Joachim Köhler
چکیده

Applications which use human speech as an input require a speech interface with high recognition accuracy. The words or phrases in the recognized text are annotated with a machineunderstandable meaning and linked to knowledge graphs for further processing by the target application. This type of knowledge representation facilitates to use speech interfaces with any spoken input application, since the information is represented in logical, semantic form., retrieving and storing can be followed using any web standard query languages. In this work, we develop a methodology for linking speech input to knowledge graphs. We show that for a corpus with lower WER, the annotation and linking of entities to the DBpedia knowledge graph is considerable. DBpedia Spotlight, a tool to interlink text documents with the linked open data is used to link the speech recognition output to the DBpedia knowledge graph. Such a knowledge-based speech recognition interface is useful for applications such as question answering or spoken dialog systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Two-level speech recognition to enhance the performance of spoken dialogue systems

Spoken dialogue systems can be considered knowledge-based systems designed to interact with users using speech in order to provide information or carry out simple tasks. Current systems are restricted to well-known domains that provide knowledge about the words and sentences the users will likely utter. Basically, these systems rely on an input interface comprised of speech recogniser and seman...

متن کامل

An Acoustic Study of Emotivity-Prosody Interface in Persian Speech Using the Tilt Model

This paper aims to explore some acoustic properties (i.e. duration and pitch amplitude of speech) associated with three different emotions: anger, sadness and joy against neutrality as a reference point, all being intentionally expressed by six Persian speakers. The primary purpose of this study is to find out if there is any correspondence between the given emotions and prosody patterning in P...

متن کامل

Towards Multimodal Dialogue Management

abstract EEective dialogue management is a key issue in speech-based interfaces to information systems since it can ensure a cooperative interaction with the user. Cooperativeness requires techniques which allow the user to eeeciently access information and also techniques which compensate for limitations in system knowledge and speech technology. The paper describes management techniques devel...

متن کامل

P65: Speech Recognition Based on Bbrain Signals by the Quantum Support Vector Machine for Inflammatory Patient ALS

People communicate with each other by exchanging verbal and visual expressions. However, paralyzed patients with various neurological diseases such as amyotrophic lateral sclerosis and cerebral ischemia have difficulties in daily communications because they cannot control their body voluntarily. In this context, brain-computer interface (BCI) has been studied as a tool of communication for thes...

متن کامل

Towards best practices for speech user interface design

Designing speech interfaces is difficult. Research on spoken language systems and commercial application development has created a body of speech interface design knowledge. However, this knowledge is not easily accessible to practitioners. Few experts understand both speech recognition and human factors well enough to avoid the pitfalls of speech interface design. To facilitate the design of b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1705.09222  شماره 

صفحات  -

تاریخ انتشار 2017